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rrn.K  of  project:  Development  of  a  Medical  Research  Institute  Directed  at  Basic  and 
Clinical  Research  Immunology 

PROJECT  PERIOD:  August  1,  1989  -  July  31,  1992 

GRANT  NUMBER:  DAMD17-89-Z-9042 

PRINCIPAL  INVESTIGATORS:  Robert  E.  Hall,  M.D.,  Ph.D.  (8/1/89-10/1/91),  Lee  A. 
Henderson,  Ph.D.  (10/1/91  -  7/31/92) 

(a)  Front  Qqysl 

(b)  Report  Documentation  (SF298) 

(c)  Foreword 

(d)  Table  of  Contents 
f  e)  INTRODUCTION 

The  immune  system  plays  a  crucial  role  in  defense  against  a  host  of  human  diseases, 
including  bacterial  and  viral  infections,  cancer,  rheumatologic  diseases,  and  immune  deficiency 
diseases,  of  which  AIDS  represents  the  most  important  new  serious  public  health  threat  of  the 
20th  cratury.  A  detailed  understanding  of  Ae  normal  immune  system,  as  well  as  the 
pathophysiology  of  the  abnormal  immune  system,  are  therefore  of  great  importance  to  the 
effective  prevmition  and  treatment  of  a  variety  of  diseases  (including  AIDS)  which  might  be 
encountered  by  military  personnel.  This  project  was  designed  to  expand  and  further  develop  the 
Guthrie  Research  Institute,  a  medical  research  institute  directed  at  basic  and  clinical  research  in 
immunology.  Further,  the  project  was  designed  to  promote  basic/clinical  research  collaborative 
studies  of  human  immunologic  diseases  with  the  Guthrie  Clinic,  with  which  the  Guthrie  Research 
Institute  is  closely  affiliated.  Speciftc  Aims  of  this  project  included  (1)  renovation  of  existing 
space  to  create  three  new  basic  immunology  research  laboratories  and  one  Clinical  Research 
Immunodiagnostic  Laboratory;  (2)  recruit,  equip,  and  provide  start-up  funding  for  the  three  new 
basic  research  laboratories,  at  least  one  of  wMch  will  devote  a  majority  effort  toward  AIDS;  and 
(3)  equip  and  provide  start-up  funding  of  the  Clinical  Research  Immunodiagnostic  Laboratory 
for  the  study  of  immunologic  diseases,  including  HIV-infected  individuals.  In  this  Final  Report, 
we  will  document  how  these  goals  were  accomplished.  We  believe  that  completing  these  goals 
has  contributed  to  building  a  strong  basic  scientific  base  in  immunology  for  the  Army,  which 
should  help  in  better  understanding  and  treating  AIDS  and  other  diseases  of  importance  to  the 
military. 
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(OfiQDX 

Specific  Aiml;  Renovation  of  Existing  Space  to  Create  Three  New  Basic  ItnmunoIo£v  Research 
T^horatnries  and  one  Clinical  Research  Immunodiagnostic  Laboratory  (CRIU. 

During  the  first  project  year,  architectural  plans  were  drawn  up  and  bids  requested  to 
renovate  £q)proximately  8,000  square  feet  of  space  on  the  third  floor  of  the  Guthrie  Research 
Institute  building  and  approximately  400  square  feet  on  the  first  floor  (for  the  CRIL).  Bids  were 
requested  according  to  DOD  guidelines,  the  lowest  bid  accepted,  and  the  renovation  project 
completed.  This  included  an  approximately  1,(XX)  square  foot  state-of-the-art  BL3  containment 
facility  for  use  in  experiments  involving  HIV  or  other  infected  materials.  Similarly,  basic  large 
equipmoit  (centrifuges,  gamma  and  beta  counters,  coldroom,  spectrophotometer,  etc.)  was 
purchased  after  receiving  at  least  three  bids.  The  new  facility  was  opened  on  August  1,  1990. 

Specific  Aim  2:  Recruit.  Equip,  and  Provide  Start-up  Funding  For  the  Three  New  Basic 
Research  Laboratories.  At  Least  One  of  Which  Will  Devote  a  Majority  Effort  Toward  AIDS. 

In  the  8,0(X)  square  feet  of  basic  research  space  on  the  third  floor  of  the  Research 
Building,  the  BL-3,  a  common  equipment  room  and  three  contiguous  research  laboratories  were 
housed.  Three  new  independent  investigators  were  recruited  to  the  Guthrie  Research  Institute. 
Each  laboratory  was  adequately  equipped  and  start-up  supply  funds  provided  for  research 
supplies.  All  laboratories  at  the  Guthrie  Research  Institute,  Including  these  three  new 
laboratories,  are  currently  engaged  in  submitting  proposals  for  additional  future  funding  of 
research  to  Non- Army  sources  of  funding  (including  NIH  and  other  non-governmental  agencies). 
In  addition,  a  renewal  application  to  continue  the  research  started  during  this  project  period  has 
been  submitted  to  the  Army. 

A  brief  description  of  each  new  laboratory  is  provided  below;  a  more  detailed  summary 
of  research  activities  during  the  project  period  is  provided  later  in  the  BODY  of  this  report.  An 
important  goal  accomplished  in  this  project  is  that  these  three  laboratories  collaborate 
significantly  in  the  area  of  HIV  immunobiology.  An  updated  Curriculum  Vitae  for  each  new 
investigator  is  also  provided  in  this  report. 

John  D.  Noti.  Ph.D.  was  a  Research  Associate  in  Dr.  Hall’s  laboratory  (2nd  floor,  research 
^)ace  which  antedated  this  grant)  until  7/1/89  at  which  time  he  was  promoted  to  Assistant 
Scientist  (equivalent  to  Assistant  Professor)  and  moved  into  one  of  the  newly-renovated 
laboratories  on  the  third  floor.  Dr.  Noti’s  research  interests  include  the  molecular  biology  of 
leultocyte  adhesion  molecules,  HTV-leukocyte  receptors,  and  tumor  cell  adhesion  molecules 
involv^  in  goieration  of  metastases. 

Lee  A.  Hoiderson.  Ph.D.  was  an  established  HIV  immunologist  on  the  faculty  of  Tulane 
University  Medical  School  and  joined  the  Guthrie  Research  Institute  on  December  1,  1990,  as 
Director  of  the  BL-3  Facility  and  independent  research  investigator.  In  view  of  his  more  senior 
status.  Dr.  Henderson  actively  advised  the  careers  of  Drs.  Noti  and  Kestler  (below),  and  was 
integral  to  developing  collaborative  efforts  in  HIV  research  among  these  three  labs.  Dr.  Robert 
Hall  was  Scientific  Director  of  the  Guthrie  Research  Institute  and  Principal  Investigator  of  this 
project  until  10/1/92  at  which  point  he  moved  his  laboratory  to  the  University  of  Tennessee. 


3 


Dr.  Henderson  was  the  logical  choice  to  assume  the  role  as  Principal  Investigator  of  this  project, 
and  this  was  proposed  to  and  approved  by  the  Army.  Dr.  Henderson  has  continued  to  actively 
investigate  HIV  virus-leukocyte  receptors,  including  CD4  and  non-CD4  mechanisms,  as  well  as 
the  mechanism  of  viral  infection. 

Daniel  P.  Kestler.  Ph.D.  was  a  Research  Associate  at  Indiana  University  School  of  Medicine 
prior  to  joining  the  Guthrie  Research  Institute  on  9/1/91  as  independent  investigator.  Dr.  Kestler 
is  an  immunologist  and  molecular  biologist  whose  current  research  interests  include  HIV 
immunology,  the  molecular  biology  of  HlV-leukocyte  receptor  interactions,  and  RNA  and  DNA 
tumor  virus  leukemagenesis. 

Summary  of  Research  Results: 

GP41  Interaction  with  nonCD4  proteins  on  Human  Ceils. 

A  gp41  peptide,  CS3,  interacts  with  a  polypeptide  complex  of  200-250  Kd  on  human  cells  that 
minimally  consists  of  two  polypeptides  of  45  and  80  Kd  (p45  and  p80,  respectively).  The  45 
Kd  protein  is  not  detected  on  murine  cells.  Both  CS3  and  carrier  conjugates  of  CSS  (CS3-HSA) 
bind  to  p45  and  p80.  Furthermore,  CS3-HSA  (0.1-10  ug/ml  peptide  or  40  nM)  blocks  HIV 
infection  to  the  extent  of  75-90%  depending  on  the  isolate  of  HIV  used.  Rabbit  polyclonal 
antibody  to  p80  recognizes  both  the  200-250  Kd  complex  which  binds  to  CSS  and 
immunopiecipitates  p80.  Anti-p80  sera  also  blocks  HIV  infection  with  an  ICso  at  a  dilution  of 
1: 1000.  CSS  interacts  with  human  cells  with  high  affinity  binding  to  two  distinct  sites  of  35,000 
and  140,000  molecules  per  cell  and  dissociation  constants  of  10^  and  lO*^  M,  respectively.  Both 
p45  and  p80  have  been  purified  by  affinity  chromatography  or  conventional  means  including  a 
final  step  on  elution  from  a  reverse  phase  (C4)  column  in  the  presence  of  an 
acteonitrile/isopropanol  gradient.  Preliminary  studies  with  a  recombinant  form  of  gp41  show 
that  it  blocks  CSS  binding  with  an  affinity  (pseudoaffinity)  constant  1(X)  fold  greater  than  peptide 
itself. 

The  ability  of  CS3-HSA  and  anti-p80  to  block  HIV  infection  is  either  through  direct  interference 
with  the  ad}ility  of  viral  gp4I  to  interact  with  this  putative  receptor  or  by  altering  the  activation 
state  of  the  cell  in  a  manner  that  renders  cells  refractory  to  HTV  infection.  Future  plans  are  to 
clone  the  graes  which  encode  p45  and  p80,  determine  whether  viral  gp41  directly  interacts  with 
the  p45/p80  complex  and  the  development  of  peptide  analogues  for  mimetics  drug  design  and 
use  in  therapy  for  AIDS. 

Revelant  publications: 

M.  Nasar  (^reshi,  David  H.  Coy,  Robert  F.  Garry  and  Lee  A.  Henderson.  Characterization  of 
a  putative  cellular  receptor  for  HIV  transmembrane  glycoprotein  using  synthetic  peptides.  AIDS 
4:553,  1990. 

William  R.  Gallaher,  Lee  A.  Henderson,  Cesar  Fermin,  Ronald  C.  Montelero,  Angela  Martin, 
M.  Nasar  Qureshi,  Judith  M.  Ball,  Quentin  Sattentau,  Hong  Luo-Zang,  and  Robert  F.  Garry. 
Membrane  Interactions  of  HIV:  Attachment,  Fusion,  and  Cytopathology.  In,  Advances  in 
Membrane  Fluidity  (Eds.  R.C.  Aloia  and  C.C.  Curtain),  6:113-142,  1992. 
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Lee  A.  Henderson,  Nasar  Qureshi,  Daniel  Kestler,  and  John  Noti.  Novel  gp41  binding  proteins 
may  play  a  direct  role  in  HIV-1  infection:  Anti-receptor  antibody  blocks  HIV  infection. 
Manuscript  submitted. 

Lee  A.  Henderson.  Characterization  of  receptor  complex  that  interacts  with  a  peptide  derived 
from  HIV  gp41  .  Manuscript  in  preparation. 

Daniel  P.  Kestler,  Lee  A.  Henderson  and  John  Noti.  Construction  and  expression  of  recombinant 
HIV-1  gp41  constructs  in  prokaryotes.  Manuscript  submitted. 

Transcriptional  Regulation  of  the  Leukocyte  bitegrins. 

A  key  factor  in  understanding  the  mechanisms  of  cell-cell  and  cell-matrix  interactions  in 
the  immune  system  has  been  the  discovery  of  leukocyte-specific  cell  surface  receptors  involved 
in  these  interactions.  These  receptors,  referred  to  as  the  leukocyte  integrins,  consist  of  three 
molecules,  LFA-1,  Mac-1,  and  pl50-95.  Elucidation  of  the  molecular  mechanisms  that  govern 
the  expression  of  the  leukocyte  integrins  will  facilitate  studies  of  adhesive  interactions  in  the 
immune  system.  We  have,  therefore,  begun  studies  designed  to  elucidate  the  transcriptional 
regulation  of  these  molecules.  The  genomic  sequences  for  the  plSO-95  and  LFA-1  a  subunits 
were  isolated  in  this  laboratory  and  we  have  extensively  characterized  the  plS0-9S  sequence 
(aqipraded  manuscript).  We  have  begun  to  identify  regulatory  elements  that  essential  for  this 
gene's  expression  in  the  pre-macrophage  cell  lines,  U937  and  HL60,  and  are  expanding  our 
efforts  to  include  similar  studies  of  the  LFA-1  a  subunit.  Currently,  using  deletion  mutagenesis 
and  DNase  I  footprinting  analysis,  we  have  identified  both  silencer  and  enhancer  elements  that 
regulate  plS0-9S  gene  expression.  Efforts  to  identify  the  binding  proteins  that  interact  with  these 
elements  are  underway. 

Isolation  of  Genes  Associated  with  Breast  Tumor  Metastasis. 

The  major  cause  of  death  in  most  malignant  diseases  is  not  the  primary  tumor,  but  rather 
the  metastatic  spread  of  tumor  cells  to  distant  sites  within  the  body.  Metastasis  is  a  complex 
multistq)  process  that  is  undoubtably  regulated  by  many  different  genes  including  the  integrin 
genes.  We  are  using  the  subtractive  hybridization  technique  to  select  for  genes  that  are 
differentially  expressed  in  normal  and  metastatic  breast  cells.  As  sources  of  metastatic  and  non¬ 
metastatic  cells  we  are  using  established  breast  cell  lines  and  surgically-isolated  tissue. 

Relevant  publications. 

Noti,  J.D.,  Gordon,  M.,  and  Hall,  R.E.  (1992).  Human  plS0-9S  a  subunit:  Genomic 
organization  and  analysis  of  the  5’  flanldng  region.  DNA  and  Cell  Biol.  H,  123-138. 

Noti,  J.D.  and  Gordon,  M.  Intron  1  of  the  human  pl50-95  a  subunit  gene  contains  enhancer  and 
PMA-responsive  elements  essential  for  transcription.  Manuscript  in  preparation. 
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Specific  Aim  3:  Equip  and  Provide  Start-up  funding  of  the  Clinical  Research  Immunodiaenostic 
T^hofatarv  for  the  Study  of  Immunologic  Diseases.  Including  HIV-infected  Individuals. 

As  described  in  Specific  Aim  1,  approximately  400  square  feet  of  space  on  the  first  floor 
of  the  research  building  was  renovated  to  house  a  new  Clinical  Research  Immunodiagnostic 
Laboratory  (CRIL).  The  goal  of  this  facility  was  to  perform  detailed  immunologic  evaluations 
of  patients  with  immunologic  diseases.  Such  diseases  were  intended  to  include  both  hereditary 
and  acquired  immunologic  diseases.  Such  diseases  were  intended  to  include  both  hereditary  and 
acquired  immunodeficiency  diseases  (HIV  and  non-HIV  immunodeficiencies)  as  well  as  allergic, 
rheumatologic,  and  malignant  diseases.  Although  a  stated  goal  of  this  project  was  to  provide 
new  information  relevant  to  AIDS  and  HIV  infection,  our  stated  rationale  was  that  information 
relevant  to  HIV  could  be  gained  by  studying  other  human  immunodeficiency  states.  The  CRIL 
was  also  intended  to  provide  "spin-off  ideas  which  might  generate  important  new  basic  research 
projects  relevant  to  human  immunologic  disease  states.  Dr.  Hall  served  as  Supervisor  of  the 
CRIL  until  10/91,  at  which  time  Dr.  Henderson  became  CRIL  Supervisor  and  Principal 
Investigator  of  this  project  (see  above). 

In  collaboration  between  Dr.  Hall  (Scientific  Director  and  Principal  Investigator  until 
10/91),  other  Guthrie  Scientists,  and  Dr.  Joseph  Foiled  (Chief,  Section  of  Allergy  and  Clinical 
Immunology  at  the  Guthrie  Clinic)  the  following  laboratory  evaluations  were  set  up  and  were 
made  fieely  available  to  Guthrie  Clinic  physicians  for  immunologic  evaluation  of  patients.  These 
tests  were  considered  a  clinical  research  tool  and  not  billed  to  the  patient  or  insurance  company. 
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CLINICAL  RESEARCH/IMMUNODIAGNOSTIC  LABORATORY  (CRIL) 
Laboratory  Evaluations,  June  1,  1990 


1.  Flow  Cytometry 

a.  Surface  Antigen  Phenotype  of  Acute  and  Chronic  Leukemias  (Peripheral  Blood  and  Bone 
Marrow) 

b.  T  and  B  Cell  Subsets  (Non-Malignant)  For  Immunodeficiency  Workup  and  Immune 
Evaluation 

c.  Cell  cycle/Ploidy  Analysis  of  Select  Neoplasms  (e.g.,  Endoscopic  Biopsies  of  Colon) 

2.  Lymphocyte  Transformation/  Proliferation 

a.  To  Mitogens  (Phytohemagglutinin,  Con  A,  Pokeweek  Mitogen) 

b.  To  Antigens  (PPD,  Candida,  Mumps) 

c.  To  Drugs  (Evaluation  of  Drug  Hypersensitivity) 

3.  Neutrophil  Function  Studies  (Immunodeficiency  Workup) 

a.  Nitroblue  Tetrazolium  (NBY)  Reduction 

b.  Chemotaxis 

c.  Chemiluminescence 

4.  Cytotoxicity  Assays 

a.  Cytotoxic  T  Lymphocytes  (Cell  Mediated  Lympholysis) 

b.  Natural  Killer  Cell  Assay  To  Tumor  Cells 

5.  Select  Complement  Components  (e.g..  Cl  inhibitor) 

6.  Allergy  Studies 

a.  Total  Serum  IgE  (Radioimmunoassay) 

b.  '  Allergen-Specific  IgE  By  RAST  (Radioallergosorbent  Test) 

7.  Lymphocytotoxic  Antibody  Assay 
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With  recruitment  of  Dr.  Henderson  and  other  HIV  scientists,  the  following  HIV  in  vitro 
evaluation  were  added  to  the  available  list: 

1.  Western  blot  analysis  for  HIV  antibody  specificity. 

2.  Viral  titres  in  sera  and  titre  produced  by  PBMC  following  mitogen  activation. 

3.  Soluble  CD4  and  AZT  sensitivity  of  clinical  viral  isolates. 

(g)  CQNCLUSIOHS 

In  conclusion,  all  of  the  goals  of  the  original  project  proposal  have  been  accomplished 
within  the  project  period.  Although  this  project  was  intended  to  be  largely  developmental, 
nevertheless  modest  basic  research  progress  has  been  accomplished  in  this  short  time  frame 
(considering  that  new  investigators  were  not  "on  board"  at  the  start  of  the  project,  but  were 
phased  in  over  the  three  years).  More  importantly,  careers  of  promising  young  investigators 
have  been  launched,  and  this  is  expected  to  create  important  scientific  dividends  for  many  years 
to  come.  The  Guthrie  Research  Institute  has  maintained  a  sincere  desire  to  collaborate  directly 
with  Army  Scientists  in  order  to  directly  assist  in  Army  research  projects,  and  it  is  hoped  that 
these  collaborations  as  well  as  with  other  HIV  research  laboratories  will  develop  in  the  future. 
In  addition,  present  and  future  research  at  the  Guthrie  Research  Institute  should  add  to  the 
scientific  base  available  to  Army  scientists  in  order  to  better  diagnose  and  treat  AIDS  and  other 
diseases  in  which  the  immune  system  plays  an  important  role.  Finally,  creation  of  the  CRIL 
should  facilitate  communication  between  Guthrie  Research  Institute  laboratories  and  the  Guthrie 
Clinic,  which  will  facilitate  identification  of  new  insights  into  AIDS  and  other  immunologic 
disease  as  well  as  help  communicate  basic  scientific  finding  to  clinicians. 
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Human  pi 50,95  a-Subunit:  Genomic  Organization  and 
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ABSTRACT 

LFA-1,  Mac-1,  and  plS0,95  comprise  a  family  of  cell-surface  glycoproteins  that  mediate  adhesive  interac¬ 
tions  of  myeloid  and  lymphoid  cells.  These  glycoproteins  are  heterodimers  composed  of  a  common 
i3-subunit  and  distinct  a-subunits.  The  chromosomal  gene  for  Ihe  a-subunit  of  pl50,95  was  isolated  to  pro¬ 
vide  a  framework  from  which  to  study  the  mechanisms  for  expression  of  the  gene.  The  gene  spans  30  kb  of 
DNA  and  contains  31  exons.  In  agreement  with  a  previous  report  by  Corbi  et  al.  (1990),  the  exons  were 
found  to  be  divided  into  five  groups  separated  by  large  introns.  The  extracellular  domains  are  encoded  in 
exons  2  through  30  while  the  transmembrane  and  cytoplasmic  domains  are  encoded  in  exons  30  and  31.  We 
have  expanded  these  findings  in  a  number  of  ways.  The  first  exon  contains  ihe  5'  untranslated  region.  The 
2,163-bp  S'-fianking  sequence  contains  the  first  intron  and  several  putative  transcriptional  initiation  sites 
preceded  by  two  TATA  sequences  and  two  GC-like  boxes.  Additional  sequence  motifs  for  a  variety  of  DNA- 
binding  proteins  are  present  and  discussed.  Fusions  of  the  bacterial  chloramphenicol  acetyltransferase  gene 
(CAT)  to  approximately  S.3  kb  of  S'-flanking  DNA  and  also  to  subcloned  fragments  of  this  region  were  con¬ 
structed  and  transfected  into  the  human  promonocytk  cell  line,  1)937.  CAT  expression  was  inducible  with 
phorbol-12«myristrate-13-acetate  (PMA)  and  full  expression  was  dependent  on  the  presence  of  intron  1  and 
sequences  upstream  from  the  2,163-bp  flanking  DNA.  Additionally,  intron  1  and  a  region  further  upstream 
contain  functional  cis-acting  sequences. 


INTRODUCTION 

The  inteciun  faioly  of  cell-surface  receptors  is  exten¬ 
sively  involved  in  cell-cell  and  cell-matrix  interactions 
of  the  immune  system.  These  receptors  are  glycoproteins 
that  are  composed  of  noncovalently  associated  a-  and  /3- 
subunits  (Hynes,  1987;  Williams  and  Barclay,  1988;  Kishi- 
moto  et  al.,  1989;  Tedder  et  al.,  1989;  Larson  and  Sprin¬ 
ger,  1990).  The  integrin  family  is  divided  into  three  sub¬ 
families  that  have  unique  j3-subunits  but  distinct  a-sub¬ 
units.  The  01  subfamily  contains  three  receptors,  LFA-l, 
Mac-1,  and  pi 50,95  which  are  present  on  leukocytes  and 
mediate  a  broad  spectrum  of  adhesive-dependent  func¬ 
tions.  A  heritable  deficiency,  leukocyte  adhesion  defi¬ 
ciency  (LAD)  of  all  three  receptors  from  the  leukocytes  of 
affected  individuals  is  characterized  by  severe,  recurrent 
bacterial  infections  that  arc  life-threatening  (Fischer  et  al., 
1988).  The  genetic  deficiency  is  due  to  mutations  in  the 
common  |3-subunit. 


In  addition  to  the  involvement  of  these  receptors  in  in¬ 
flammatory  phenomena,  these  molecules  also  are  involved 
in  the  killing  of  tumor  cells  by  cytotoxic  T  cells  (CTLs) 
(Davignon  et  al.,  1981),  natural  killer  (NK)  cells,  and  lym- 
phokine-activated  killer  cells  (LAK)  (Schmidt  et  al.,  1985); 
Martz,  1987).  Recent  evidence  also  suggests  a  role  for  these 
receptors  in  immunosurveillance.  Some  lymphomas  includ¬ 
ing  the  majority  of  Burkitt’s  lymphomas  do  not  express 
LFA-1  on  their  cell  surface  (Clayberger  et  al.,  1987; 
Rousset  et  al.,  1989;  Stauder  et  al.,  1989).  These  observa¬ 
tions  imply  that  a  decreased  expression  of  one  or  more  ad¬ 
hesion  molecules  may  be  a  means  by  which  some  tumors 
escape  immunosurveillance. 

The  distribution  of  these  molecules  differs  with  the  par¬ 
ticular  cell  type.  LFA-1  (Kurzinger  et  al. ,  1981)  is  found  on 
virtually  all  leukocytes  whereas  Mac-1  and  pi 50,95  are 
found  primarily  on  monocytes,  macrophages,  and  gran¬ 
ulocytes.  (Springer  et  al.,  1979,  1986).  The  first  appearance 
of  the  leukocyte  integrins  occurs  as  stem  cells  begin  to  dif- 
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ferentiate.  LFA-1  is  expressed  on  pre-B  and  late  myelo¬ 
blasts,  whereas,  Mac-1  and  plS0,9S  are  detected  as  mono¬ 
cyte  and  granulocyte  differentiation  increases  and  is  con¬ 
sistent  with  the  detection  of  increased  expression  of  the 
0-subunit  when  HL-60  promyelocytic  leukemia  cells  differ¬ 
entiate  in  vitro  along  the  granulocytic  pathway  (Miller  et 
ai,  1986).  LFA-1  and  pl50,95  are  also  found  on  CTLs  and 
NK  cells,  although  higher  amounts  of  LFA-1  are  expressed 
on  both  cell  types  (Lanier  et  al.,  198S).  plS0,9S  is  also 
highly  expressed  on  hairy  leukemia  cells  and,  as  such,  has 
been  used  as  an  immunodiagnostic  marker  for  this  disease 
(Schwarting  et  al.,  198S).  Recently,  this  receptor  has  been 
found  on  some  B-cell  chronic  lymphocytic  leukemias, 
some  large-cell  lymphomas  of  uncertain  origin,  and  also, 
on  some  neoplastic  T-cell  (Chadburn  et  al.,  1990). 

The  amino  acid  sequences  of  all  three  a-subunits  and  the 
common  0-subunit  have  previously  been  determined 
(Corbi  et  al.,  1987;  Law  et  al.,  1987;  Arnout  et  al.,  1988; 
Larson  et  al. ,  1989).  To  understand  the  molecular  basis  for 
the  expression  and  function  of  the  leukocyte  integrins,  we 
(Noti  et  al.,  1990)  and  others  (Corbi  et  al.,  1990)  have 
begun  to  isolate  the  genomic  sequences  for  these  molecules 
and  preliminarily  characterize  these  genes.  In  these  find¬ 
ings  we  report  the  isolation  of  a  cosmid  clone  that  contains 
the  complete  transcriptional  unit  of  the  plS0,9S  a-subunit 
gene.  We  have  determined  the  exon-intron  organization  of 
this  gene  and  have  localized  the  transcriptional  start  site(s). 
DNA  sequence  and  functional  analysis  were  done  to  iden¬ 
tify  transcriptionally  important  regions  in  the  S'-flanking 
DNA.  It  is  anticipated  that  this  work  will  facilitate  studies 
concerning  the  regulation  of  the  plS0,9S  a-subunit  at  the 
transcriptional  level. 

MATERIALS  AND  METHODS 

Construction  and  screening  of  the  genomic  library 

A  human  genomic  library  was  constructed  in  the  cosmid 
pWE15  (Wahl  et  al.,  1987)  from  DNA  isolated  from  hu¬ 
man  peripheral  blood  leukocytes.  A  total  of  2  x  10*  re¬ 
combinant  clones  were  screened  by  colony  hybridization 
with  a  partial  cDNA  clone  for  the  pi  30,93  er-subunit  iso¬ 
lated  from  an  HL60  cDNA  library. 

DNA  sequence  analysis 

Double-stranded  DNA  was  sequenced  by  the  dideoxy 
chain  termination  method  with  the  T?  DNA  polymerase  se- 
quenase  according  to  the  manufacturer’s  (U.S.  Biochemi- 
cals,  Geveland,  OH)  instructions.  DNA  fragments  were 
subcloned  into  the  plasmid  pBlueskript  SK*  (Short  et  a!., 
1988)  prior  to  sequencing.  Cosmid  DNA  (10  /tg)  was  also 
sequenced  directly. 

SI  nuclease  and  primer  extension  analysis 

A  modified  method  of  Berk  and  Sharp  (Berk  and  Sharp, 
1977)  was  used  for  SI  nuclease  analysis.  Single-stranded 
probes  were  prepared  from  appropriate  DNA  fragments 
subcloned  into  the  plasmid  ml3mpl8.  PoIy(A)*RNA 


(10-23  fig)  isolated  from  the  human  promonocytic  cell  line 
U937  was  annealed  to  the  probe  in  20  /il  of  SO®'*  forma- 
mide/0.4  M  NaCl/50  mM  PIPES  pH  6.4/1  mM  EDTA  for 
18  hr  at  either  43°C  or  30®C.  Then,  300  fJ  of  an  SI  cock¬ 
tail  buffer  (230  mM  NaCI/50  mM  NaOAc  pH  4.5/4.5  mM 
ZnS04/3  fig  salmon  sperm  DN A/400  units  SI  nuclease) 
was  added  for  60  min  at  37®C.  The  reaction  products  were 
analyzed  on  a  polyacrylamide  sequencing  gel. 

For  primer  extension  analysis,  10  fig  of  poly(A)*RNA 
was  annealed  with  3-4  ng  of  the  appropriate  end-labeled 
oligonucleotide.  Annealing  was  done  in  10  fil  of  0.4  M 
NaCl/40  mAf  PIPES  pH  6.4/5  mM  EDTA  for  16  hr  at 
30“C.  Ninety  microliters  of  a  cocktail  solution  (30  mAf 
Tris-HCl  pH  8.3/10  mM  MgCl,/10  mAf  dithiothreitol/0.5 
mAf  each  dNTP/60  units  placental  RNase  inhibitor/60  of 
ng  actinomycin  D/55  units  AMV  reverse  transcriptase)  was 
then  added  followed  by  incubation  for  90  min  at  42°C. 
The  products  were  analyzed  on  a  polyacrylamide  sequenc¬ 
ing  gel. 

Construction  of  CA  T  plasmids 

The  plasmid  pB4P3-CAT  was  constructed  by  first  sub¬ 
cloning  the  746-bp  Bam  Hl-Pst  I  genomic  fragment  (corre¬ 
sponding  to  the  region  -1,020  to  -1,766  from  Fig.  2) 
from  clone  8B  into  the  polylinker  of  pBluescript  SK*.  The 
resulting  plasmid  was  then  digested  with  Hind  III  and  Xba 
I  (restriction  sites  that  flank  the  Bam  Hl-ftr  I  fragment) 
and  the  fragment  containing  the  746-bp  Bam  Hl-Pst  I 
fragment  was  then  isolated  and  ligated  into  the  Hind  III- 
Xba  I  sites  upstream  from  the  CAT  gene  in  the  promoter¬ 
less  CAT  reporter  plasmid,  pCAT-Basic  (Promega,  Madi¬ 
son,  WI).  The  plasmid  pB4.1-CAT  was  constructed  by 
first  subcloning  the  4.4-kb  Bam  HI  genomic  fragment  (see 
Fig.  2)  from  clone  8B  into  pBluescript  SK*.  The  resulting 
plasmid  was  then  digested  with  Hind  III  and  Xba  I  and  the 
fragment  containing  the  4.4-kb  Bam  HI  fragment  was  then 
ligated  into  Hind  \\\-Xba  1-digested  pCAT-Basic.  The 
plasmid  pB4,3A-CAT  was  constructed  by  first  subcloning 
a  1.1-kb  Bam  Hl-Prr  I  genomic  fragment  containing  in- 
tron  1  and  the  ATG  start  codon  for  the  pl50,95  a-subunit 
from  clone  8B  into  pBluescript  SK*.  The  resulting  plasmid 
was  then  digested  with  Xba  I  (located  12  bp  from  the  Bam 
HI  site)  and  Sac  I  (located  32-bp  upstream  from  the  ATG 
codon  for  the  pl50,95  a-subunit  gene.  Fig.  2).  The  Sac  I 
terminus  of  this  fragment  was  ligated  to  a  Sac  l-Xba  I 
adaptor  prepared  by  annealing  the  two  oligonucleotides, 
3'-GTCGACGTAGCCGTTTGGTACCCCTGATAG-3' 
and  3'-CTAGCTATCAGGGGTACCAAACGGCTACGT 
CGACAGCT-3'.  Four  translational  stop  codons  (under¬ 
lined)  in  two  different  reading  frames  were  introduced  into 
this  adaptor.  The  resulting  1 ,038-bp  Xba  1  fragment  (con¬ 
taining  intron  1  but  lacking  the  ATG  codon)  was  ligated 
into  thp  Spe  1  site  located  49  bp  upstream  of  the  the  ATG 
codon  for  the  CAT  gene  in  pB4. 1-CAT.  The  plasmid 
pB4,3B-CAT  was  constructed  by  ligating  the  intron  1 -con¬ 
taining  Xba  I  fragment  in  the  opposite  orientation  into  the 
Spe  I  site  of  pB4. 1-CAT.  DNA  sequence  analysis  con¬ 
firmed  that  the  various  fragments  were  ligated  correctly 
into  each  parent  plasmid.  The  plasmids  pSVCat-I  and 
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pSVCAT-2  were  constructed  by  ligating  the  1.1 -kb  Bam 
Hl-Psr  I  fragment  containing  intron  I  and  the  4.4-kb  Bam 
HI  fragment  from  clone  8B,  respectively,  into  the  enhan¬ 
cerless  SV40  promoter  plasmid,  pCAT-Promoter  (Pro- 
mega,  Madison,  WI).  The  plasmid  CMV-CAT  was  con¬ 
structed  by  ligating  the  0.7-kb  Sau  3A  fragment  containing 
the  CAT  gene  from  the  pCAT-Basic  plasmid  into  the  Bam 
HI  site  of  pcDNAI  (Invitrogen  Corp.,  San  Diego,  CA). 
The  plasmid  CMV-CAT  contains  the  cytomegalovirus  pro¬ 
moter/enhancer  DNA  fused  to  the  CAT  gene. 

Cell  culture 

The  cell  line  U937  was  grown  in  RPMI-1640  medium 
supplemented  with  lO^o  fetal  calf  serum,  1(X)  U  of  penicil¬ 
lin  per  ml,  and  1(X)  ^g  of  streptomycin  per  ml.  HeLa  cells 
were  maintained  in  Dulbecco’s  modified  Eagle’s  medium 
(DMEM)  supplemented  with  10%  fetal  calf  serum  and 
penicillin-streptomycin. 

Transfections  and  CA  T  assays 

HeLa  cells  were  transfected  by  the  DEAE-dextran 
method  (Gorman  et  al.,  1982)  with  modifications.  HeLa 
cells  (2  X  10*)  were  transfected  with  30  ^g  of  plasmid 
DNA  in  4  ml  of  DMEM  supplemented  with  10%  NuSerum 
(Collaborative  Research,  Waltham,  MA)  and  100  m8  of 
DEAE-dextran.  After  4  hr  at  37'’C,  the  medium  was  then 
removed  and  4  ml  of  DMEM  supplemented  with  10%  Nu¬ 
Serum  and  10%  DMSO  was  added  for  2  min  at  room  tem¬ 
perature.  The  cells  were  washed  and  IS  ml  of  tissue  culture 
medium  was  then  added.  PMA  was  added  at  (2  ng/ml)  24 
hr  later.  The  cells  were  harvested  72  hr  after  transfection. 

Electroporation  was  used  to  transfect  U937  cells.  The 
cells  were  washed  once  with  warm  PBS  and  2  x  10’  cells 
were  resuspended  in  0.2  ml  of  RPMI-1640  supplemented 
with  10  toM  dextrose,  0. 1  mM  dithiothreitol  (DTT).  After 
adding  50  fig  of  plasmid  DNA  in  20  fil  of  RPMI-1640/dex- 
trose/DTT,  the  cells  were  then  subjected  to  a  single  electric 
pulse  (capacitance,  960  fiF;  voltage,  2(X)  V;  duration,  28-30 
msec)  in  a  BioRad  gene  pulsar  apparatus.  The  ceUs  were 
transferred  to  a  lOO-mm  tissue  culture  dish  containing  15 
ml  of  tissue  culture  medium.  PMA  (2  ng/ml)  was  added  24 
hr  later.  The  cells  were  harvested  72  hr  after  electropora¬ 
tion. 

The  cells  were  harvested,  resuspended  in  100  fil  of  0.25 
M  Tris-HCl  pH  8.0  and  lysed  by  freezing  and  thawing. 
After  normalization  for  protein  content,  CAT  activity  was 
determined  with  a  conunercially  available  kit  (Promega, 
Madison,  Wl).  The  reaction  products  were  separated  by 
thin-layer  silica  gel  chromatography.  The  plates  were  auto- 
radiographed,  and  the  acetylated  products  were  cut  from 
the  plate  and  counted  in  a  scintillation  counter.  The  CAT 
assays  were  performed  in  duplicate  and  repeated  three  to 
Hve  times  to  ensure  reproducibility. 

Polymerase  chain  reaction  analysis 

The  polymerase  chain  reaction  (PCR)  was  done  directly 
on  poly(A)*RNA  isolated  from  U937  cells  transfected  with 


pB4,3A  CAT  or  pB4,3B  CAT  as  described  (Brenner  et  al., 
1989).  First-strand  cDNA  was  synthesizsed  from  50-100  ng 
of  poly(A)*RNA  in  a  20-^^  volume  with  oligo(dT).  Five  mi¬ 
croliters  of  the  first-strand  cDNA  was  amplified  by  PCR 
for  35  cycles  with  sense  primers  derived  from  the  -1,627 
to  - 1,059  region  of  the  pi  50,95  y-subunit  gene  (sec  Fig.  2) 
in  combination  with  an  antisense  primer  to  the  5'  end  of 
the  CAT  gene.  The  PCR  products  were  analyzed  on  a  4% 
NuSieve  agarose  gel  (FMC  Corp.,  Rockland,  ME). 

RESULTS 

Isolation  of  the  genomic  sequences  for  the 
pi 50,95  a-subunit 

Five  cosmid  clones  were  isolated  and  analyzed  by  com¬ 
parative  restriction  enzyme  analysis  and  by  hybridization 
with  oligonucleotides  specific  for  the  amino-terminal,  car- 
boxy-terminal,  and  central  portions  of  the  pl50,95  cDNA 
sequence.  One  cosmid  clone,  designated  8B,  contained  a 
35-kb  insert  that  hybridized  with  these  oligonucleotide 
probes.  Total  genomic  DNA  digested  with  several  restric¬ 
tions  enzymes  and  hybridized  with  clone  8B  DNA  as  probe 
revealed  a  restriction  enzyme  pattern  identical  to  the  pat¬ 
tern  of  DNA  fragments  of  the  inserted  DNA  in  this  clone 
(data  not  shown).  This  indicated  that  the  genomic  DNA  in 
clone  8B  was  from  a  contiguous  region  of  chromosomal 
DNA.  The  restriction  enzyme  map  of  the  region  is  shown 
(Fig.  1). 

Exon-intron  organization 

The  exon-intron  organization  was  determined  by  DNA 
sequence  analysis.  An  exon-intron  boundary  was  indicated 
when  the  sequence  from  the  genomic  clone  diverged  from 
that  of  the  cDNA  (Corbi  el  al.,  1987).  With  this  strategy 
more  than  15  kb  of  DNA  was  sequenced.  The  DNA  se¬ 
quences  of  all  the  exon-intron  junctions  contained  the  in¬ 
variant  gt  and  ag  intronic  dinucleotides  (Table  I).  With  the 
exceptions  of  exons  I  and  31  (detailed  below),  the  wx  of 
each  exon  was  less  than  210  nucleotides.  The  exons  encod¬ 
ing  the  protein  sequence  for  the  o-subunit  of  pi 50,95 
(exons  2-31)  span  approximately,  30  kb  of  DNA  (Fig.  1) 
and  are  clustered  in  five  groups  that  correlate  with  previ¬ 
ously  identified  domains  of  thu  protein  (Corbi  et  al. ,  1987; 
see  Discussion). 

Identification  of  an  intron  in  the  5' -flanking  region 

The  2,163-nucleotide  sequence  upstream  of  the  ATG 
translational  start  site  was  determined  and  shown  in  Fig.  2. 
Two  putative  3'  splice  acceptor  consensus  sequences  are 
present  52  nucleotides  and  65  nucleotides  upstream  from 
the  ATG  codon.  To  determine  whether  these  are  func¬ 
tional  splice  sites,  SI  nuclease  and  primer  extension  analy¬ 
sis  was  performed.  For  SI  analysis,  a  177-nucleotide  single- 
stranded  probe  was  prepared  that  spanned  these  two  po¬ 
tential  splice  sites.  This  probe  was  then  hybridized  with 
poly(A)’RNA  from  PMA-stimulated  U937  cells  followed 
by  SI  nuclease  digestion.  Two  major  and  one  minor  pro¬ 
tected  fragments  were  detected.  The  size  of  these  protected 
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FIG.  1.  The  intron-exon  organization  of  the  plS0,9S  a-subunit  gene.  The  exons  are  in  solid  black  boxes  and  are  num¬ 
bered.  The  S'  untranslated  region  is  shown  in  open  boxes  and  is  separated  by  intron  1.  The  3'  untranslated  region  is 
shown  as  a  hatched  box.  A  partial  restriction  enzyme  map  is  shown:  E,  Eco  RI;  K,  Kpn  I;  H,  Hind  III;  S,  Sal  I;  B,  Bam 
HI;  P,  Pst  1;  X,  Xba  1.  The  exons  are  clustered  into  five  groups  as  indicated.  SP,  Signal  peptide;  TM,  transmembrane 
domain;  CD,  cytoplasmic  domain. 


fragments  (Fig.  3)  correlated  with  the  position  of  the  3' 
consensus  splice  sequences,  thus,  indicating  the  presence  of 
an  intron  in  the  S'-flanking  region.  This  was  confirmed  by 
primer  extension  analysis  with  the  N-6  oligonucleotide 
(Fig.  3).  Extension  of  the  N-6  oligonucleotide  on 
poly(A)*RNA  from  PMA-stimulated  U937  cells  revealed 
several  extension  products  that  were  considerably  longer 
than  the  SI  nuclease  protected  fragments.  The  longest  pri¬ 
mer-extension  product,  360  nucleotides,  indicated  that  the 
mature  mRNA  for  the  p 150,95  a-subunit  extends  at  least 
this  far  upstream  of  the  ATG  translational  start  codon. 

The  5'-flanking  sequence  was  scanned  for  the  presence 
of  the  5'  splice  donor  site.  A  consensus  splice  sequence  is 
present  650  nucleotides  upstream  of  the  ATG  start  codon. 
To  determine  if  this  is  a  functional  splice  site,  primer  ex¬ 
tension  analysis  was  done  with  primers  tha  flanked  this 
site.  As  shown  in  Fig.  4,  no  distinct  extension  products 
were  seen  when  oligonucleotides  N-34  and  N-35  Oocated  3' 
of  this  site)  were  used  as  primers.  However,  extension 
products  254  and  301  nucleotides  long  resulted  when  oligo¬ 
nucleotide  N-8  (located  5’  of  this  site)  was  used  as  the  pri¬ 
mer.  This  indicated  that  the  consensus  5'  splice  sequence 
located  650  nucleotides  upstream  of  the  ATG  start  codon 
is  functional. 

Determination  of  the  site  of  transcriptional  initiation 

Initial  primer  extension  analysis  with  the  N-6  oligonu¬ 
cleotide  resulted  in  two  major  extension  products  270  and 
360  nucleotides  long  (Fig.  3).  Since  the  potential  for  gener¬ 
ating  prematurely  terminated  extension  products  increases 
with  the  length  of  the  mRNA  being  transcribed,  analysis 
was  performed  with  primers  that  hybridized  with  se¬ 
quences  upstream  of  intron  1.  The  length  of  each  primer 
extension  product  was  obtained  by  comparison  with  the  se¬ 
quencing  ladders  in  the  adjacent  lanes.  Two  extension 
products,  214  and  408  nucleotides  long  (corresponding  to 
-1,178  and  -1,371,  respectively),  were  detected  when  pri¬ 
mer  N-9,  the  primer  closest  to  the  5'  splice  acceptor  site  of 
intron  1,  was  extended  (Fig.  5).  Primer  N-24,  which  is  lo¬ 
cated  60  nucleotide  5'  to  primer  N-9,  was  extended  to  55 


nucleotides  in  length  (corresponding  to  -1  080)  (Fig.  5). 
The  heterogeneity  of  the  sizes  of  the  extended  products 
may  result  from  multiple  transcriptional  starts,  or  might  be 
the  result  of  an  inability  of  reverse  transcriptase  to  extend 
products  beyond  a  secondary  mRNA  structure.  The  small¬ 
est  extension  products  of  the  N-9  and  N-24  primer  are  pre¬ 
ceded  by  a  29-nucleotide  A-t-T-rich  sequence  (-1,108  to 
-1,080)  that  potentially  can  form  a  stem-loop  structure 
(Fig.  2).  Conceivably,  however,  this  A+T-rich  sequence 
may  serve  as  an  alternative  initiation  site  (Broders  et  at., 
1990).  The  results  of  extension  reactions  be^nning  5'  to 
this  A-^T-rich  region  are  shown  in  Fig.  5.  Two  major  ex¬ 
tension  products,  81  and  189  nucleotides  in  length  (corre¬ 
sponding  to  -1,200  and  -1,371,  respectively),  were  de¬ 
tected  when  N-22  was  used  as  the  primer. 

The  upstream  start  sites  at  - 1 ,308  and  - 1 ,200  are  pre¬ 
ceded  by  a  TATA  sequence  (Matsui  el  ai,  1980)  at 
-1,331.  The  longest  extension  product  at  -1,371  is  up¬ 
stream  of  this  TATA  sequence  but  is  also  preceded  by  a 
TATA-like  sequence  begining  at  -1,531  (TAAAAAT). 
Two  sequences  that  share  homology  with  the  Spl  consen¬ 
sus  sequence  are  found  within  the  region  of  the  TATA  se¬ 
quences  (Dynan  and  Tjian,  1983)  (Fig.  2). 

Analysis  of  the  5'-flanking  sequence 

The  5'-flanking  sequence  was  analyzed  (Ghosh,  1990) 
for  other  sequences  that  could  potentially  bind  transcrip¬ 
tional  regulatory  proteins  (Table  2).  The  sequence, 
GAACAG,  at  position  -2,108  is  homologous  to  a  se¬ 
quence  motif  in  the  glucocorticoid  response  element  (GRE) 
in  the  Moloney  murine  leukemia  virus  (Mo^MuLV)  en¬ 
hancer  (speck  and  Baltimore,  1987).  The  sequence, 
CCACCA,  present  in  the  region  upstream  of  the  immuno¬ 
globulin  heavy-chain  enhancer  (Augereau  and  Chambon, 
1986)  is  also  found  upstream  (positions  -2,033  and 
-1,729)  of  the  putative  TATA  boxes.  At  position  -1,925 
is  a  sequence  identical  to  the  GTI  motif  in  the  SV-40  en¬ 
hancer  (Siao  et  a!.,  1987).  The  sequence  GGCCAA  is 
found  at  position  -1,810  and  is  homologous  to  a  low-af¬ 
finity  CTF/nuclear  factor  I  binding  site  (Garcia  et  at.. 
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Table  I. 

Sequence  at  exon-iniron  junction 
Exon  Exon  Intron  - 


no. 

size  (bp) 

size  (bp) 

S'  Splice  donor 

3'  Splice  acceptor 

1 

714/777 

594/607 

CGG  gCgaggcaccc  .... 

2 

102 

500  i50 

ACA  G  gtgagcctgg  . . . 

. . tcttccccag  CC  TTA 

3 

106 

650  *50 

TCC  TG  gtgaggccca.  . . 

. . . ctcctcgcag  G  CTG 

4 

104 

300  140 

CAG  G  gtgagtcacc  ... 

..tgtcccacag  TG  CCC 

5 

71 

161 

CTG  gtgagttgcc . 

6 

112 

2000  1200 

CAG  G  gtgagtgtct. ... . 

..ccttccccag  AG  TGC 

7 

IJI 

85 

CAG  gtgtgctttg . 

8 

146 

200  150 

GTC  GT  gtccgtccCg. . . 

. . . ttcctgatgg  G  CAC 

9 

154 

500  1200 

GGG  gtaggcctgg . 

10 

151 

500  1200 

GAG  G  gtgagtctga. . . . 

..cttacccaag  GT  ACG 

11 

74 

149 

CCT  gtgcgtgggg . 

12 

130 

300  1150 

CTG  G  gcgagaaaca. . . . 

..ggcccctcag  GT  TAC 

13 

143 

180  120 

CAG  gttgggcgtg . 

14 

141 

88 

GGG  gtgagttgct . 

15 

210 

6500  1200 

CAG  gtgaggccgt  . 

16 

131 

131 

CTC  AG  gtgagagcag. . . 

..ctgcccgcag  GA  CCA 

17 

163 

100  140 

AGC  C  gcgagccccc. . . . 

..ccgcccacag  GT  GAC 

18 

155 

500  ilOO 

CCG  gtgcgtctgg . 

. cgtcccccag  AGC 

19 

132 

100  130 

TCC  gtgagtcctg . 

20 

74 

500  1150 

CCA  GG  gtgagcgccc. . . 

..tgacacccag  CT  TGA 

21 

142 

3500  1200 

CAG  gtgcaccctc . 

22 

117 

84 

CAG  gtcagcctgg . 

23 

80 

99 

AGC  AG  gtgagccggg. . . 

. . . .ctttctccag  T  GAG 

24 

84 

2100  ilOO 

AGC  AG  gtcagcaggt. . . 

. . . .ccctttccag  C  CAC 

25 

79 

108 

CAG  gtcaggtggtg . 

26 

108 

76 

CAG  gtacctggat . 

27 

84 

200  150 

CTG  gtgaggaggg . 

28 

114 

73 

CAG  gcgtgtgggt . 

29 

102 

250  140 

CAG  gtagagacca . 

30 

111 

1000  IlOO 

AAA  gcgagtgtct . 

127 


31 


1247 


12» 
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FIG.  3.  Location  of  intron  1  in  the  S'  untranslated  re¬ 
gion.  A.  Primer  extension  analysis  of  U937  poly(A)*- 
RNA  with  the  N-6  primer  is  shown.  The  size  of  the  longest 
extension  product  (360  nucleotides)  is  shown  next  to  a 
DNA  sequencing  ladder.  B.  SI  nuclease  analysis  of  U937 
poly(A)*RNA  is  shown.  Hybridizations  were  done  at  either 
43®C  of  SO'C  prior  to  the  addition  of  SI  nuclease.  The 
samples  (1  id  and  5  id)  were  loaded  adjacent  to  a  DNA  se¬ 
quence  l^der  obtained  with  the  N-6  primer.  The  major 
protected  products  (88  and  89  nucleotides)  and  a  minor 
protected  product  (71  nucleotides)  are  indicated.  Undi¬ 
gested  probe  DNA  is  shown.  C.  A  schematic  of  the  Si 
nuclease  reaction  is  shown.  ’*P-Labeled  N-6  primer  was  ex¬ 
tended  to  the  Sau  3A  (S)  site  with  Klenow  enzyme  to  gen¬ 
erate  a  177-nucleotide  single-stranded  probe.  The  3'  site  of 
intron  1  is  shown.  U937  Poly(A)*RNA  that  is  not  homolo¬ 
gous  to  the  probe  is  indicated  with  a  dotted  line. 


FIG.  4.  Location  of  the  S'  end  of  intron  1 .  Primer  exten¬ 
sion  analysis  of  U937  poly(A)*RNA  with  the  N-8,  N-34, 
and  N-35  primers  is  shown.  The  N-34  and  N-35  primer-ex¬ 
tension  products  (1-^  and  5-^1  samples)  were  loaded  adja¬ 
cent  to  a  DNA  sequence  ladder.  The  N-8  primer  extension 
products  are  shown  next  to  a  DNA  sequence  ladder.  The 
size  of  each  extension  product  in  nucleotides  is  indicated. 


FIG.  2.  Nucleotide  sequence  of  the  S'-flanking  region.  A.  The  2,163-nucleotide  sequence  upstream  from  the  ATG 
translational  start  site  designated  + 1  is  shown.  The  S'  and  3'  consensus  splice  sites  are  in  brackets.  The  location  of  the 
splice  sites  determined  by  SI  nuclease  analysis  are  indicated  with  closetj  arrowheads.  The  numbers  88  and  71  adjacent  to 
the  closed  arrowheads  refer  to  the  size  of  the  major  and  the  minor  protected  fragments  (see  Fig.  4).  The  location  of  the 
oligonucleotides  used  for  primer  extension  and  SI  nuclease  analysis  are  indicated  with  horizontal  arrows.  The  transcrip¬ 
tional  start  sites  are  indicated  with  open  arrowheads  and  the  primer  used  to  obtain  that  particular  site  is  shown  next  to 
the  arrow.  Putative  TATA  sequences  are  boxed;  the  potential  binding  sequences  for  transcriptional  factors  are  under¬ 
lined;  and  a  potential  hairpin  loop  is  overlined.  B.  A  schematic  of  the  transcriptional  start  sites  in  relation  to  the  puta¬ 
tive  TATA  boxes  is  shown.  The  numbering  corresponds  to  the  DNA  sequence. 
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FIG.  5.  Location  of  the  transcriptional 
start  site(s).  Primer  extension  analysis  of 
U937  poly(A)*RNA  with  the  N-9,  N-24, 
and  N-22  primers  is  shown.  The  location  of 
each  primer  is  shown  in  Fig.  2.  The  N-22 
and  N-24  primer  extension  products  were 
loaded  adjacent  to  DNA  sequence  ladders 
generated  with  the  same  primers,  respec¬ 
tively.  The  N-9  primer  extension  products 
are  shown  next  to  a  DNA  sequence  ladder. 
The  size  of  each  extension  product  in  nu¬ 
cleotides  is  indicated. 


1987) .  The  sequence  located  at  position  -1,23S  resembles 
a  DNase  I  hypersensitive  site  found  in  the  rabbit  utero¬ 
globin  gene  region  (Jantzen  et  ai,  1987).  The  binding  site 
for  the  EBP20  protein  is  found  within  the  SV40  core  C  ele¬ 
ment  (Johnson  et  al.,  1987)  and  is  present  at  position 
-800.  Finally  the  consensus  binding  sequence  for  a  NF- 
xB-like  factor  found  in  the  interleukin-2  receptor  a-chain 
(Leung  and  Nabel,  1988)  is  present  at  position  -90S.  This 
site  is  particularly  interesting  in  that  xB-like  sites  are  asso¬ 
ciated  with  cell-surface  glycoproteins  (Baldwin  and  Sharp, 

1988) . 

Within  the  S91-bp  region  of  intron  1  are  found  16  se¬ 
quence  motifs  that  could  potentially  interact  with  DNA- 
binding  proteins.  Sequence  motifs  for  a  progesterone-re¬ 
sponsive  element  in  the  chicken  lysozyme  gene  (LysHREI) 
(Hecht  et  al. ,  1988),  and  a  tissue-spedfic  element  in  the  hu¬ 
man  Oi-antitrypsin  (a.-AT)  gene  (De  Simone  et  al,  1987) 
are  present  at  positions  -648  and  -S64,  respeaively.  Two 
octamer-binding  sequences  found  in  a  variety  of  genes,  in¬ 
cluding  the  SV40  (Zenke  et  al. ,  1986)  and  immunoglobulin 
(Augereau  and  Chambon,  1986)  enhancers,  are  present  at 
positions  -530  and  -255,  respectively.  Tandem  AATA- 
AAT  sequences  lie  near  the  5'  end  of  the  intron.  This  se¬ 
quence  has  been  reported  to  be  involved  in  mediating  cell- 
specifk  and  thyroid  hormone  stimulation  of  the  rat  growth 
hormone  gene  (GH-CSE2)  (Ye  et  al.,  1988).  A  DNA  motif 
similar  to  the  liver-specific  DNA-binding  protein  EBP20- 
ADEIl  (Costa  et  al.,  1988)  is  present  at  position  -440. 
The  sequence  found  at  position  -408  is  identical  to  the  se¬ 


quence  found  within  the  core  enhancer  for  SV40  (Weiher 
et  al.,  1983).  The  Chinese  hamster  ovary  adenine  phospho- 
ribosyl-transferase  gene  (APRT)  contains  the  sequence 
GCCCCRCC  (Park  and  Taylor,  1988)  which  is  found  at 
position  -272.  Multiple  PEA3  sequence  motifs  (Martin  et 
al.,  1988)  are  located  at  positions  -107,  -75,  and  -51. 
The  sequence  AGATGACT  (Garcia  et  al.,  1987)  at  posi¬ 
tion  -222  shares  homology  with  the  consensus  sequence 
(C/G)TGACT(C/A)A  that  is  recognized  by  the  binding 
protein  AP-1  (Lee  et  al.,  1987).  Additionally,  the  sequence 
GGTGAGCCTGGG  at  position  -281  is  nearly  identical  to 
a  sequence  within  the  TCI  and  TCII  motifs  of  the  SV40 
enhancer  (Zenke  et  al.,  1986).  Both  TCI  and  TCII  can 
bind  the  AP-2  protein  (Mitchell  et  al.,  1987).  The  AP-1 
and  AP-2  sites  are  responsive  to  phorbol  esters  (Angel  et 
al.,  1987;  Chiu  et  al.,  1987)  and,  therefore,  it  will  be  of 
particular  interest  to  investigate  these  sequences  as  the 
pi 50,95  a-subunit  gene  is  upregulated  by  phorbol  esters. 

Expression  of  chloramphenicol  acetyltransferase 
directed  by  the  5' -flanking  region 

The  results  described  above  showed  that  transcription  is 
initiated  at  one  or  more  sites  within  the  region  -1,200  to 
-1,371.  To  determine  if  a  functional  promoter  is  within 
this  region,  the  746-bp  Bam  Hl-ftt  I  fragment  corre¬ 
sponding  to  the  region  -1,020  to  -1,766  (Figs.  2  and  6) 
was  inserted  upstream  of  the  CAT  gene  in  the  promoterless 
and  enhancerless  plasmid  pCAT-Basic.  The  resulting  plas- 
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Table  2.  Location  and  Identification  of  Potentmi  Regclatorv  Elements 


Regulatory  element 

J'— J'  Sequences 

Location 

LVaRS 

GAACAG 

-2,108 

(Speck  and  Baltimore,  1987) 
lgHC3 

CCACCA 

-2,033,  -1,729,  -1,237,  -383 

(Augereau  and  Chambon,  1986) 
GTI-SV40 

GGGTGTGG 

-1,925 

(Siaoem/.,  1987) 

NFI-E3 

GGCAA 

-1,810,  -1,148 

(Garcia  er  <7/. ,  1987) 

PEA3RS 

AGGAAG 

-1,793,  -107,  -75,  -51 

(Martin  et  al.,  1988) 

SPl 

GCCCGG 

-1,598,  -1,595,  -1,501,  -138 

(Dynan  and  Tjian,  1983) 

TATA 

TAAAAAT 

-1,531 

(Matsui  et  al.,  1980 

TATA 

ATATAA 

-1,332 

(Matsui  et  al.,  1980) 
uteroglob.  1 

RYYWSGTG 

-1,235 

(Jantzen  et  al.,  1987) 

Ad2MLP 

TATAAA 

-1,099,  -605 

(Garcia  er  u/. ,  1987) 

NFKB  CS2 

RGGGRMTYYC 

-905 

(Leung  and  Nabel,  1988) 
EP20-SV40 

TTCCACAC 

-800 

(Johnson  er  a/.,  1987) 

LysHREI 

CAGAGAACACAGG 

-648 

(Hechtetal.,  1988) 
alAT-CS 

TGTGGTTT 

-564 

(De  Simone  et  al.,  1987) 
oct-B2-SV4 

CTTTGCAT 

-530 

(Zenke  el  al.,  1986) 

GH-CSE2 

AATAAAT 

-1,098,  -487 

(Yeetal.,  1988) 

EBP20-ADEI1 

TCCTACCC 

-440 

(Costa  era/.,  1988) 

SV40.13 

TGGAAAG 

-408 

(Wciher  ef  al.,  1983) 

SV40.4 

CCCAGGCTCCCC 

-281 

(Zenke  era/.,  1986) 

APRT  cons 

GCCCCRCC 

-272 

(Park  and  Taylor,  1988) 

OCTA3 

ATTTGCAT 

-255 

(Augereau  and  Chambon,  1986) 
AP1-E3.2 

AGATGACT 

-222 

(Garcia  E/ a/.,  1987) 

Previously  identifled  ONA-binding  proteins  (regulatory  elements)  and  Ihe  specific  DNA  sequence  these  proteins 
recognize  are  shown.  The  location  of  homologous  DNA  sequences  in  ihe  S'-flanking  region  is  indicated.  The  num¬ 
bers  correspond  to  the  DNA  sequence  shown  in  Fig.  2. 


mid,  pB4P3-CAT,  was  transfected  into  U937  cells  by  elec*  presence  of  PMA  (Fig.  6),  indicating  that  additional  flank- 
troporation.  It  has  previously  been  shown  that  stimulation  ing  DNA  was  required.  To  determine  if  this  conclusion 
of  U937  with  PMA  for  48  hr  leads  to  a  marked  inaease  in  was  correct,  pB  4.1-CAT,  which  contains  approximately 

plSO,95  a-subunit  mRNA  and  surface  expression  (Miller  et  3.6  kb  of  additional  contiguous  upstream  flanking  DNA, 

al.,  1986).  Therefore,  24  hr  after  transfection,  PMA  was  was  similarly  assayed.  Stimulation  with  PMA  resulted  in 

added  and  the  cells  were  assayed  for  CAT  activity  48  hr  an  approximately  threefold  increase  of  CAT  activity  over 

later.  Little  or  no  CAT  activity  was  detected  even  in  the  the  background  levels  observed  in  the  absence  of  PMA 
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FIG.  6.  Functional  characterization  of  the  S'-flanking  region.  A  partial  restriction  map  of  the  S'-flanking  regioin  of  the 
plS0,9S  a-subunit  gene  (clone  8B)  is  shown:  B,  Bam  HI;  P,  Psi  I;  Sac  1.  The  size  and  location  of  the  S'-flanking  DNA 
fragments  subcloned  into  pCAT-Basic  are  shown  (not  drawn  to  scale).  The  region  of  the  S'-flanking  DNA  fused  to  the 
CAT  gene  in  each  construct  is  indicated.  The  orientation  of  intron  1  relative  to  that  in  clone  SB  is  indicated  with  an 
arrow.  The  region  of  transcription  initiation  is  indicated  by  a  wavy  arrow.  The  autoradiogram  shows  the  results  of  chro¬ 
matographic  separation  of  acetylated  products  formed  with  2S  ng  of  cell  extract  per  reaction.  The  presence  (-)-)  or  ab¬ 
sence  (-)  of  (2  ng/ml)  during  the  transfection  is  indicated.  CAT  activity  in  cells  transfected  with  pB4,3A-CAT  in 
the  presence  of  PMA  was  defined  as  lOOV*.  The  efficiency  of  electroporation  was  monitored  with  pCMV-CAT  as  con¬ 
trol.  The  transfections  were  performed  three  to  five  times,  and  representative  data  are  presented. 


(Fig.  6).  The  identification  of  a  number  of  DNA  sequence 
motifs  for  DNA-binding  protein>  within  intron  1  suggested 
that  additional  transcriptional  control  sequences  may  con¬ 
tribute  to  the  expression  of  the  pl30,9S  a-subunit  gene. 
The  effect  of  intron  1  on  CAT  activity  was  determined 
after  transfection  of  pB4,3A-CAT  into  U937.  PMA  stimu¬ 
lation  leads  to  a  20-foid  increase  in  CAT  activity  over 
background  levels  in  the  absence  of  PMA  a-.d  a  8.3-fold 
increase  over  levels  detected  in  the  absence  of  intron  1 
(compare  pB4.1-CAT  with  pB4,3A-CAT,  Fig.  6).  This  en¬ 
hanced  stimulation  of  CAT  activity  was  independent  of 
the  orientation  of  intron  1.  CAT  activity  in  U937  cells 
transfected  with  either  pB4,3A-CAT  or  pB4,3B-CAT, 
which  contains  intron  1  in  the  opposite  orientation,  was 
the  same  (Fig.  6).  The  stimulation  of  CAT  activity  by 
PMA  was  specific  to  these  CAT  fusions  as  the  cytomegalo¬ 
virus  enhancer/promoter  DNA  in  the  control  plasmid 
pCMV-CAT  was  shown  to  efficiently  drive  CAT  gene  ex¬ 
pression  in  the  absence  of  PMA. 


Determination  of  the  transcriptional  start  sitefs) 
from  the  transfected  CAT  constructs 

SI  nuclease  analysis  was  performed  on  poly(A)*RNA 
isolated  from  PMA-stimulated  U937  ceUs  transfected  with 
either  pB4,3A  CAT  or  pB4,3B  CAT  to  determine  if  tran¬ 
scription  was  correctly  initiated  from  these  constructs.  The 
5'  end-labeled  single-stranded  probes  spanned  the  746-bp 
Bom  Hl-Pt/ 1  fragment  containing  the  transcriptional  ini¬ 


tiation  sitefs)  of  the  endogenous  pi 50,95  y-subunit  gene. 
The  first  12  nucleotides  at  the  5'  end  of  probe  758  and  the 
first  14  nucleotides  at  the  5'  end  of  probe  760  (Fig.  7B)  are, 
however,  not  homologous  to  the  endogenous  gene  but  are 
instead  unique  to  the  CAT  constructs.  During  the  con¬ 
struction  of  pB4,3A  CAT  and  pB4,3B  CAT  an  additional 
18  and  41  nucleotides,  respectively,  were  introduced  adja¬ 
cent  to  the  Bam  HI  site  at  - 1 ,025  in  the  S'-flanking  region. 
The  S'  end-labeled  protected  fragments,  therefore,  corre¬ 
spond  to  RNA  initiated  from  the  CAT  constructs  only. 
Three  major  protected  fragments  were  observed  in  RNA 
from  cells  transfected  with  pB4,3A  CAT.  One  major  frag¬ 
ment  was  obs?r/ed  in  RNA  from  cells  transfected  with 
pB4,3B  CAT  (Fig.  7A,B).  The  location  of  these  transcrip¬ 
tion  initiation  sites  is  just  downstream  of  the  TATA  box  at 
-1,331. 

PCR  analysis  of  poly(A)*RNA  isolated  from  transfected 
cells  was  also  done  to  correoborate  the  SI  nuclease  analysis 
results.  A  series  of  PCR  primers  was  designed  successively 
further  upstream  of  the  transcriptional  initiation  sites  of 
the  endogenous  pi  50,95  y-subunit  gene  and  used  in  combi¬ 
nation  with  an  antisense  primer  to  the  CAT  gene  (Fig.  7C). 
Sense  primers  upstream  of  -1,295  did  not  yield  a  PCR 
product.  Sense  primers  at  -1,295  and  further  down¬ 
stream,  however,  yielded  PCR  products  of  the  expected 
sizes.  This  indicates  that  transcription  from  these  con¬ 
structs  initiates  at  or  near  -1,295.  SI  nuclease  and  PCR 
analysis,  taken  together,  strongly  suggest  that  transcription 
initiates  at  or  very  near  the  initiation  site(s)  of  the  endoge¬ 
nous  gene. 
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FIG.  7.  SI  analysis  and  PCR  analysis  to  determine  transcriptional  start  sites  from  transfected  CAT  constructs.  A. 
Poly(A)*RNA  was  isolated  from  U937  ceUs  transfected  with  pB4,3ACAT,  pB4,3BCAT,  or  no  DNA  (mock)  and  hybrid¬ 
ized  to  a  speciHc  S'  end-labeled  probe.  The  location  of  a  short  stretch  of  nucleotides  in  each  construct  not  present  in  the 
corresponding  region  of  the  endogenous  gene  is  indicated  by  a  black  box.  Probes  758  and  760  have  incoiporated  this  re¬ 
gion  (see  text).  After  SI  nuclease  digestion,  the  hybrids  were  analyzed  on  an  89*  polyacrylamide.  The  major  protected 
producu  (297,  240,  and  232  nucleotides)  resulting  from  transfection  with  pB4,3ACAT  and  the  major  protected  product 
(310  nucleotides)  resulting  from  transfection  with  pB4,3BCAT  are  shown.  The  location  of  the  undigested  probes  (not  vis¬ 
ible)  is  indicated.  Lane  M,  123-bp  ladder.  B.  Schematics  of  the  SI  nuclease  reactions  are  shown.  Probes  758  and  760 
span  the  Bam  HI  (B)  and  Pst  I  (P)  sites  at  positions  -1,025  and  -1,766,  respectively.  The  transcriptional  start  sites  de¬ 
termined  from  the  size  of  each  protected  fragment  are  shown  in  parentheses.  The  region  of  transcriptional  initiation 
(-1,371  to  -1,200)  from  the  endogenous  pl50,93  a-subunit  gene  is  shown.  Wavy  arrow  denotes  direction  of  transcrip¬ 
tion.  C.  Poly(A)*RNA  was  isolated  from  U937  cells  transfected  with  pB4,3ACAT  or  pB4,3BCAT  and  amplified  with  a 
series  of  PCR  primers.  The  antisense  primer  was  derived  from  the  5'  end  of  the  CAT  gene.  The  sense  primers  were  de¬ 
rived  from  the  5'-flanking  re^on  of  the  pl50,95  a-$ubunit  gene  (-1,627  to  -1,059).  The  size  of  the  PCR  product  ob¬ 
tained  with  each  primer  pair  is  indicated  on  the  gel  photograph  and  the  schematic. 


Effect  of  the  5' -flanking  DNA  on  the 
SV40  promoter 

The  presence  of  an  enhancer  element  within  intron  1  was 
indicated  since  the  stimulation  of  CAT  activity  from 
pB4,3A-CAT  and  pB4,3B-CAT  was  independent  of  the 
(mentation  of  intron  1.  To  study  the  effect  of  intron  I  on  a 


heterologous  promoter,  the  1.1 -kb  Bam  Hl-ftt  I  fragment 
containing  this  intron  was  subcloned  approximately  1.6  kb 
upstream  of  the  SV40  promoter-CAT  fusion  in  the  pCAT- 
promoter  plasmid.  Since  additional  sequences  upstream  of 
intron  1  were  also  found  to  be  essential  for  transcription  of 
the  pl50,95  a-subunit  gene  (results  of  transfection  with 
pB4.I-CAT),  it  was  of  interest  to  determine  if  the  SV40 
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promoter  could  also  be  stimulated  by  DNA  from  this  re¬ 
gion.  Therefore,  the  4.4-kb  Bam  HI  fragment  upstream  of 
intron  I  in  clone  8B  was  also  subcloned  into  the  pCAT- 
promoter  plasmid.  The  two  resulting  constructs,  pSV- 
CAT-l  (containing  intron  1)  and  pSV-CAT-2  (containing 
the  4.4-kb  Bam  HI  fragment)  were  transfected  into  HeLa 
cells.  As  controls,  the  pCAT-promoter  plasmid  and  the 
pCAT-control  plasmid  (which  contains  an  SV40  enhancer 
and  promoter,  Promega,  Madison,  WI)  were  also  trans¬ 
fected.  The  pCAT-promoter  plasmid  was  inactive  in  HeLa 
cells,  however,  both  pSV-CAT-l  and  pSV-CAT-2  ex¬ 
pressed  high  levels  of  CAT  activity  (Fig.  8).  The  stimula¬ 
tion  of  CAT  activity  by  the  4.4-kb  Bam  HI  fragment  in 
pSV-CAT-2  is  equivalent  to  that  of  the  SV40  enhancer  in 
the  pCAT-control  plasmid.  Additionally,  the  pSV-CAT-2 
plasmid  expressed  5-  to  10-fold  more  CAT  activity  than 
pSV-CAT-1.  However,  the  expression  of  the  CAT  gene 
was  not  increased  by  PMA  with  either  construct.  These  re¬ 
sults  indicated  that  one  or  more  enhancers  were  present 
within  the  5'-flanking  DNA  of  the  plS0,9S  a-subunit  gene 
and  that  the  cellular  and  promoter  specificities  of  these  en¬ 
hancers  are  not  limited  to  U937  cells  and  the  pi 50,95  a- 
subunit  promoter,  respectively.  Alternatively,  the  5'-nank- 
ing  DNA  may  encode  trans-acting  factors. 

Analysis  of  the  3' -flanking  sequence 

Exon  31  contains  the  last  102  nucleotides  of  the  trans¬ 
lated  sequence  before  the  TGA  stop  codon  and  extends 
into  the  3'  untranslated  DNA.  To  see  if  other  introns  are 
present,  the  entire  untranslated  region  was  sequenced.  A 
comparison  of  this  1.5-kb  sequence  with  the  published  3' 
untranslated  region  contained  in  the  cDNA  sequence  re¬ 
vealed  a  number  of  base  changes  and  additions.  However, 
no  additional  introns  were  present.  The  polyadenylation 
signal  sequence,  AATAAA,  is  found  at  the  3'  end  (Fig.  9). 
Within  the  178  nucleotides  downstream  of  the  AATAAA 
sequence  are  three  sequences  that  show  homology  to  previ¬ 
ously  identified  3'  sequence  elements.  The  sequence 
CACTG  is  homologous  to  the  sequence  CAYTG  (Gil  and 
Proudfast,  1987)  found  3'  of  the  AATAAA  site  of  many 
vertebrate  genes.  The  sequence  TTGTGAATAT  resembles 
another  consensus  motif,  TTG/ANNNTTTTTT  (Bimstiel 
et  al.,  1985),  which  also  is  present  in  the  region  immedi¬ 
ately  beyond  the  polyadenylation  signal  of  many  genes. 
Three  nucleotides  downstream  of  this  second  sequence  is 
found  a  third  sequence,  TCTCTTGGAA,  which  is  similar 
to  the  GT-rich  homology  TGTGTTGGAA  present  in  the 
interferon-7  Sene  (Taya  el  al. ,  1982). 


DISCUSSION 

In  this  paper  we  have  described  the  isolation  and  charac¬ 
terization  of  a  genomic  clone  for  the  pi 50,95  a-$ubunit 
gene.  Our  studies  characterizing  the  exon-intron  organiza¬ 
tion  of  this  gene  are  in  general  agreement  with  a  previous 
report  by  Corbi  et  al.  (1990),  with  a  few  exceptions  (see  be¬ 
low).  We  provided  further  characterization  of  this  gene 
with  regard  to  detailed  molecular  analysis  of  the  5'  and 
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FIG.  8.  Effect  of  the  5'-fIanking  DNA  on  the  transcrip¬ 
tional  activity  of  the  SV40  promoter.  HeLa  cells  were 
transfected  with  the  indicated  plasmid  constructs  in  the 
presence  or  absence  of  PMA  (2  ng/ml).  Approximately  50 
fig  of  cell  extract  was  assayed  per  reaction  and  CAT  activ¬ 
ity  in  cells  transfected  with  pCAT-control  (which  contains 
both  the  SV40  promoter  and  enhancer)  in  the  presence  of 
PMA  was  defined  as  lOO^o. 


AASjxATA^AACCCTTCTCTCCATATCACTCCTCAACCA 

TCACACCCATCCCCTCCCTTTACCACCCTCCACCCTCGCC 

ACCACTCACCCCACCCCCCACCACAACAACTTAACCACAC 

TACCCTTCTCAATATCCCTCTCTTCCAACTAACCTCACCT 

CTTAATCTCCTCACTCCACTCTCAGCG 

FIG.  9.  Nucleotide  sequence  of  the  3'-nanking  region. 
The  putative  polyadenylation  signal  is  boxed.  Sequence 
motifs  present  in  the  3'  regions  of  other  genes  are  under¬ 
lined. 


3’-flanking  regions.  In  agreement  with  the  above  authors, 
we  found  that  the  31  exons  are  distributed  over  approxi¬ 
mately  30  kb  of  chromosomal  DNA.  The  exons  are  clus¬ 
tered.  into  five  groups  that  correspond  to  the  domains  of 
the  plS0,9S  protein.  The  signal  peptide  coding  region  is 
split  between  exon  2  and  exon  3.  The  inserted  domain  or 
“I”  domain  is  contained  within  exons  7,  8,  9,  and  10.  This 
domain  is  speculated  to  be  important  in  the  binding  of 
ligands  (Corbi  et  al.,  1987).  The  integrin  receptors  require 
Ca'*  or  Mg**  to  bind  ligands.  The  pl50,95  or-subunit  se- 
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quence  contains  three  repeats  that  are  similar  to  the  “EF- 
hand"  loop  structure  found  in  other  Ca**-binding  proteins. 
The  three  metal-binding  domains  are  found  in  separate 
exons  (exons  14,  15,  16).  The  transmembrane  is  split  be¬ 
tween  exons  30  and  31.  Exon  30  contains  22  of  the  26 
amino  acids  of  the  transmembrane  domain.  The  cytoplas¬ 
mic  domain  is  contained  in  exon  31.  However,  we  found 
one  major  difference  and  a  few  minor  ones  over  those  of 
Corbi  et  at. .  The  size  of  intron  30  in  our  clone  is  greater 
than  I  kb,  whereas  they  reported  an  intron  size  of  only 
about  250  bp.  Additionally,  the  sizes  of  several  of  the 
smaller  introns  (e.g.,  introns  11,  23,  25,  26,  28)  were  in 
general  shorter  by  comparison  to  those  previously  re¬ 
ported.  Several  single  base  changes  were  found  in  the  in- 
tronic  5'  or  3'  consensus  sequences  (not  indicated);  how¬ 
ever,  all  junctions  were  consistent  with  the  gt-ag  rule  for 
splice  donor  and  acceptor  sites. 

The  I  domain  shares  homology  to  regions  found  in  the 
von  Willebrand  factor  (vWF)  gene  (Mancuso  et  at.,  1989), 
the  cartilage  matrix  protein  (CMP)  gene  (Kiss  el  at.,  1989), 
the  complement  factor  B  gene  (Campbell  and  Porter. 
1983),  and  the  type  IV  collagen  a  polypeptides,  a2  and  a3 
(Saitta  et  at.,  1990;  Stokes  et  at.,  1991).  The  exon-intron 
arrangement  of  the  pi 50,95  a-subunit  gene  is  similar  to 
that  of  the  vWF  domain  A3  in  that  four  exons  of  approxi¬ 
mately  equal  sizes  encode  each  of  these  gene’s  domains.  In 
contrast,  the  two  repetitive  domains  found  in  the  CMP 
gene  and  the  domains  in  the  a2(VI)  and  a3(VI)  genes  are 
each  encoded  by  two  exons  of  dissimilar  sizes.  As  is  the 
case  with  vWF  domain  A  and  the  CMP-1  and  CMP-2  do¬ 
mains,  the  introns  flanking  the  1  dommn  are  type  1 .  Exons 
that  have  flanking  introns  of  the  same  phase  class  are  theo¬ 
rized  to  have  arisen  by  gene  segment  duplication  or  exon 
shuffling.  The  functional  significance  of  the  homology  of 
the  I  domain  with  these  other  domains  is  unknown.  Since 
the  vWF  A  domain  can  bind  a  number  of  ligands,  the  I  do¬ 
main  conceivably  may  also  interact  with  similar  molecules. 

Since  the  3'  untranslated  sequence  was  not  previously  re¬ 
ported.  the  1.5-kb  DNA  sequence  immediately  following 
the  TCA  translational  stop  codon  was  completely  se¬ 
quenced.  A  comparison  of  the  3'  untranslated  sequence  of 
the  cDNA  sequence  (Corbi  el  at.,  1987)  was  made  with  the 
3'  sequence  of  our  genomic  clone.  Our  genomic  clone  has 
an  additional  47  nucleotides  between  the  translational  stop 
codon  and  the  putative  polyadenylation  site  was  compared 
to  the  cDNA  sequence;  however,  they  are  distributed 
throughout  this  region.  Therefore,  no  additional  introns 
are  present  and  the  size  of  exon  31  is  at  least  1,247  nucleo¬ 
tides. 

In  further  studies  the  2,163-bp  re^on  preceding  the 
translational  start  codon  was  completely  sequenced.  The 
presence  of  an  additional  intron  within  the  untranslated  re¬ 
gion  was  indicated  after  comparing  the  cDNA  sequence 
with  our  genomic  sequence.  The  two  sequences  diverge  52 
nucleotides  upstream  of  the  ATG  start  codon  and  a  3' 
splice  sequence  is  found  at  this  location.  However,  65  nu¬ 
cleotides  upstream  of  the  ATG  codon  is  found  a  second  3' 
splice  sequence  that  more  precisely  matches  the  consensus 
sequence  for  a  3'  splice  acceptor  site.  SI  nuclease  analysis 
for  this  region  confirmed  the  presence  of  an  intron.  More¬ 


over,  the  size  of  the  protected  fragments  (71  and  88  nu¬ 
cleotides)  suggested  that  both  3'  splice  sites  might  be  func¬ 
tional.  The  abundance  of  the  88-nucleotide-long  fragment 
was  over  10  times  greater  than  the  71 -nucleotide-long  frag¬ 
ment,  which  indicates  that  the  second  splice  site  is  the  pre¬ 
ferred  site.  A  S'  splice  sequence  (at  position  -637)  and  a 
second  putative  5'  splice  sequence  is  present  94  nucleotides 
further  downstream  (at  position  -563).  However,  primer 
extension  analysis  indicated  that  the  5'  splice  sequence  at 
position  -657  is  the  correct  site. 

Primer  extension  analysis  was  used  to  determine  the  ap¬ 
proximate  transcriptional  start  site.  Although  there  was 
heterogeneity  in  the  sizes  of  the  extended  products,  most 
transcripts  were  extended  to  within  a  100-  to  200-nucleo¬ 
tide  region  approximately  1,200  nucleotides  upstream  of 
the  ATG  translational  start  sites.  The  5'-flanking  sequence 
was  searched  for  sequences  recognized  by  DNA-binding 
proteins.  Two  putative  TATA  boxes  (Matsui  et  at.,  1980) 
are  found  within  the  region  assigned  to  the  transcriptional 
initiation  site.  Additionally,  two  putative  GC  boxes 
(Dynan  and  Tjian,  1983)  are  found  flanking  the  two 
TATA  boxes.  It  has  not  been  determined  from  these  stud¬ 
ies  whether  the  TATA  boxes  that  are  present  upstream  of 
the  most  S'  transcriptional  initiation  sites  are  functional. 
The  746-bp  region  from  -1,020  to  -1,766  that  contains 
these  sequence  motifs  could  not  alone  drive  transcription 
of  the  CAT  gene.  The  inclusion  of  an  additional  3.6  kb  of 
upstream  DNA  in  the  pB4. 1-CAT  construct  resulted  in  a 
marked  increase  in  expression  of  CAT  activity.  Additional 
5'  sequences,  thus,  appear  to  be  essential  for  expression  al¬ 
though  the  location  of  these  sequences  within  this  large 
stretch  of  DNA  was  not  determined.  The  most  dramatic 
increase  in  CAT  expression  resulted  by  including  the  re¬ 
gion  that  contains  intron  1  in  these  CAT  fusions.  The  stim¬ 
ulation  of  CAT  activity  was  independent  of  the  orientation 
of  the  intron  1 -containing  fragment  indicating  that  an  en¬ 
hancer  was  present.  Regulatory  elements  within  the  first 
intron  of  a  gene  have  been  also  found  in  the  human  4F2 
heavy-chain  gene  (Karpinsky  et  at.,  1989)  and  the  Droso¬ 
phila  ff3  tubulin  gene  (Bruhat  et  at.,  1990).  Alternatively, 
this  region  may  contain  a  gene(s)  for  a  rrons-aaivating  fac¬ 
tor,  as  a  number  of  ATG  codons  were  revealed,  although 
the  sequence  for  such  a  factor  would  have  to  be  short  be¬ 
cause  a  number  of  translational  stop  codons  are  also  scat¬ 
tered  throughout  this  region.  The  orientation-independent 
enhancement  of  CAT  activity  by  this  region  indicates  that 
the  promoter  for  the  pi 50,95  a-subunit  gene  is  not  within 
this  region.  On  the  contrary,  the  orientation  of  the  4.4-kb 
Bam  HI  fragment  in  pB4.1-CAT  is  critical  because  the  re¬ 
verse  orientation  markedly  decreases  CAT  activity  thus  in¬ 
dicating  the  presence  of  a  promoter  (data  not  shown).  Ac¬ 
tivation  of  the  SV40  promoter  by  sequences  within  the 
746-bp  Bam  Hl-Psr  I  fragment  and  the  4.4-kb  Bam  HI 
fragment  further  demonstrates  the  presence  of  cis-acting 
regulatory  regions.  Interestingly,  although  PMA  was  re¬ 
quired  for  activation  by  these  DNA  regions  in  U937  cells, 
these  sequences  were  functional  in  HeLa  cells  in  the  ab¬ 
sence  of  PMA. 

The  5'-flanking  region  of  the  pi  50,95  a-subunit  gene 
shares  some  similarities  with  the  regulatory  region  of  the 
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a4  integrin  gene  (Rosen  el  ai,  1991).  The  S'  untranslated 
regions  of  both  genes  are  unusually  long  -  approximately 
714-777  nucleotides  for  pi 50,95  and  743  nucleotides  for 
op4.  The  5'-fIanking  region  of  the  a4  gene,  however,  does 
not  contain  an  intron.  ATG  codons  precede  the  transla¬ 
tional  initiation  codon  in  the  pi 50,95  a-subunit  gene.  In¬ 
terestingly,  translation  of  the  a4  gene  begins  at  the  second 
ATG  in  exon  1.  Such  upstream  ATG  codons  are  also 
found  in  the  long  S'  untranslated  regions  of  proto-onco¬ 
genes  (Kozak,  1987)  where  they  may  be  involved  in  transla¬ 
tional  control.  Both  AP-I  and  AP-2  sites  are  also  found  in 
the  a4  gene;  however,  unlike  the  arrangement  with  pi 50,95 
a-subunit  gene,  they  are  separated  by  %1  nucleotides. 
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